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INTRODUCTION 

A  logically  consistent  market  share  model  should  predict 
market  shares  that  are  between  zero  and  one,  and  sum  to  one. 

Few  authors  have  worried  about  this  type  of  problem  in 
empirical  studies,  mainly  because  of  the  usual  interest  in  a 
particular  brand.   It  is  then  implicitly  assumed  that  if  pre- 
dicted market  share  for  that  brand  is  MS  ,  the  other  firms  com- 
bined will  get  (1  -  MS  ) .   This  may  seem  pretty  obvious,  yet  it 
is  not.   If  we  were  to  estimate  the  market  share  response  functions 
for  the  other  brands  as  well,  we  would  often  find  that  the  sum 
of  the  market  shares  is  not  one.   The  problem  becomes  particularly 
apparent  to  the  reader  when  the  response  functions  for  all  brands 
are  estimated  simultaneously.   We  will  therefore  make  constant 
reference  to  an  article  by  Neil  E.  Beckwith  in  a  recent  issue 
of  JMR  [2],  in  which  he  reported  an  application  of  the  joint 
generalized  least  squares  method  (joint  GLS)  [15]  to  the  estima- 
tion of  linear  market  share  response  functions  of  various  com- 
peting brands. 

We  will  first  examine  the  restrictions  on  the  explanatory 
variables  and  the  parameters  which  are  implied  by  the  market 
share  addltivity  constraLnt.   The  restrictions  on  the  dis- 
turbance terms  are  discussed  next.   In  the  final  section  we 
conclude  that  in  order  for  market  share  functions  to  be  logically 
consistent,  their  functional  form  almost  invariably  should  be 
intrinsically  nonlinear. 

The  appendix  contains  some  additional  comments  specific 
to  Beckwith' s  article. 

0371127 


IMPLICATIONS  OF  THE  ADDITIVITY  CONSTRAINT 

Zellner's  method  will  usually  lead  to  considerable  gains  in  efficiency 
of  the  estimators  provided  that  disturbances  of  the  different  equations  are 
contemporaneously  highly  correlated.  ([15],  pp.  353-4).   This  condition  is 
satisfied  in  the  case  of  market  share  response  functions  for  various  brands 
competing  in  an  oligopolistic  market  since,  for  example,  a  positive  distur- 
bance for  one  brand  in  a  particular  time  period  implies  that  at  least  one 
other  brand  will  have  a  negative  disturbance  in  that  same  period.   Thus  it 
would  appear  that  Zellner's  method  may  be  profitably  applied  for  joint 
estimation  of  a  set  of  market  share  response  functions. 

However,  Zellner's  method  does  not  guarantee  that  the  sum  of  the 
market  shares  predicted  from  the  estimated  functions  will  add  up  to  one. 
Yet  such  a  constraint  is  necessary  if  a  logically  consistent  model  is  desired, 
This  seems  particularly  important  when  the  estimated  parameters  are  used  to 
determine  the  optimal  advertising  expenditures  for  the  individual  brands, 
as  in  Beckwlth's  article. 

First  we  will  examine  the  implications  of  imposing  a  sum  constraint 
on  the  dependent  variable  in  a  set  of  linear  functions.   We  will  then 

apply  the  results  to  the  specific  model  proposed  by  Beckwlth.   Consider  the 

2 
following  linear  function  (e.g.  market  share  function  for  brand  i) : 

^^^     ^it  =  ^il  ^ilt  +  ^12  \2t  ^  •  •  •  ^  ^ip  V'  '°'^  -I"--  ." 


Let 


"^  =    ' 


"Uf  "Zjt hit "njt*-  f"  J-1.-".  P- 


B-   =   (e,.,  B^j.  ....  B^.)  ,  for  j  =  1.  .  .  .  ,  p 

Let  j  =  1  correspond  to  the  constant  term  in  the  regression,  that  is, 

X^^^     =1    for  i  =  1,  .  .  .  ,  n 
t  =  1,  .  .  .  ,  T 

Finally  let  u  be  a  nxl  column  vector  of  ones , 

u'   =   (1,  1,  ...  ,  1) 

3 
Theorem:   The  necessary  and  sufficient  conditions  for  a  linear  model, 

P 
y.   =   y    6,    X, ,  ,  to  predict  sum-constrained  dependent 
^it      L^  ij   ijt'    ^ 

variables,  i.e.:   u'y  =  r,  are  the  following: 

1)   the  explanatory  variables  should  be  sum- cons trained 


i.e.:   u'  X    =   c, ,  for  all  i; 
Jt      j' 

2)  the  coefficients,  excluding  the  constant  term,  should  be  equal 
across  equations,  i.e.:   3^  .  =  6'',  for  all  i  and  all  j  ^  1; 

3)  u'  B^  +  y         B^  c,  =  r.^ 

^    3^  ^ 

Sufficient  Condition:    If  u'X.^  =  c.   for  all  j,  and  B . .  =  B^  for  all  i  and 

Jt    J  -^       ij 

P    . 
for  i  =  2,  .  .  .  ,  p,  and  if  u'B-,  +   '^~  B^c.  =  r,  it  follows  that  u'y   =  r 

1    ^2    J 


Proof:   First  we  take  the  sum  of  the  dependent  variables 


(2)      u'y^   =  I       X-^3 
-,=1   -J 


X   is  a  vector  of  on is ,  and  thus  X^ '  6,  =  u'  3,.   For  all 
j  not  equal  to  one  6.  is  a  vector  of  constants  3'',   i.e.  8',  =  (3-^,  R-^  ...  B^) 


and  therefore  we  can  write 

X'   3.  =  3^  u'  X.^. 
Jt   J  jt 

With  u'X.   =  c.,  we  obtain  after  substitution  in  (2), 

P      A 

(3)     u'y,  =  u'B,  +  y       3^c.. 

With  the  right  hand  side  of  (3)  equal  to  r,  the  proof  of  sufficiency  is 
complete. 

Necessary  Condition:    Suppose  now  that  the  dependent  variables  are  sum 

constrained,  i.e.  u'y   =  r.   To  be  shown  is  that  u'y   =  r  implies  u'X_  =  c. 
^t  ''t  "^  Jt    J 


P 

Y 


for  all  i  ,  3^.  =  3^  for  all  i  and  for  1=2,  .  .  .  ,  p ,  and  u'6,  +  )     B-'c.  =  r. 
ij  1    ."^'2    J 


Proof:   1)   u'X.   =  c.  for  all  j 
Jt    J 


Consider  two  sets  of  admissible  values  for  the  independent  variables  X.  , 

^  Jt' 


and  X*  ,  that  is 
Jt 


X,    3.  ,  and 
Jt   J 


P       *' 

(5)      r  =   f   X.^   3,. 


Subtracting    (5)    from   (4) ,   we  obtain 


(6)     0  = 


y    (X    -  X  )  6.. 
M     jt     jt    J 


Without  loss  of  generality,  we  can  consider  the  following 
X°j.  /  X*^   for  j  =  k  /  1 

^°t  =  ^*t  '-^   J  *   ^- 
Equation  (6)  then  simplifies  to 

1=1 

Without  loss  of  generality  we  can  assume  6.,  #  0   only  for  i  =  s.    Suppose 

now  that  X.   is  not  sum  constrained.   The  following  could  then  be  admissible 
Jt  ^ 


\t 


^ikt   =  4t   fo-  i  '^  - 


^Ikt  ^     \kt   for  i  =  - 


Equation  (7)  now  becomes 

0   =   6  ,   (X  .   -  X*  .  ). 
sk    skt     skt 

°  * 

Since  B  ,  ?'  0,  and  X  ,   ?^  X  ,  ,  this  is  a  contradiction  and  hence 
sk  skt    skt 


u'Xj^   =  Cj    for  j  =  2,  . 


Note  that  for  i=l,  u'X.   is  by  definition  constant  and  equal  to  n. 
•^       Jt     ^ 


2)   3,.   =  B^   for  j=2, 
ij 


Consider  the  following  vectors  for  the  explanatory  variables 


X^^  ,t  x^^   for  j  =  k  /  1 

X.;  =  X*^   for  j  ^  k 

Assume  that  the  X   vectors  satisfy  the  sum  constraint  conditions 
derived  in  part  (1).   Let  X,^  be  defined  as  follows 


in  part  (1).   Let  X   be 


^ikt  =  ^ikt   ^"'^  ^  '^  ^'^ 


(8)     X.  =  X.+A,  with  A  /  0 
wkt      wkt 

*        ° 

vkt      vkt 


Thus,  the  X.   vectors  also  satisfy  the  sum  constraint  conditions.   Now 
Jt 

assume  that 

wk    vk* 
Substituting  (8)  into  (7)  we  find 

°   =   ^  <^k  -  ^k> 

With  A  5^  0,  assuming  3  ,  j^  6  ,  results  in  a  contradiction.   Thus 
vk    wk 

3.  .  =  B^  for  j=2,  .  .  .  ,  p 

P    -i 
3)      u'Bi  +  t       B-lc.  =  r 


Substituting  the  results  of  parts  (1)  and  (2)  into  equation  (2),  we  obtain 


P         A 

Given  u'y   =  r,  the  proof  is  complete. 

Let  us  now  apply  the  theorem  to  the  specific  model  proposed  by  Beckwith. 
Market  share,  MS.    is  a  function  of  lagged  market  share,  MS   ^^  and 
advertising  share  AS    : 

(9)  MS^^^   =   A^MS^^  ^_^+  Y^  AS.^^+  t.^^. 

The  constraint  on  the  dependent  variable  is 

lil  ^^,t  =  '' 
By  definition,  the  explanatory  variables  are  also  sum  constrained, 


n 


^  "^i  t-1  "  ^         and     II  AS^  ^  =  1. 


i=l 


Following   the  theorem,  however,  we  should  also  have 
X   =  A   for  all  i, 

(10)     Y^   =   Y   for  all  i, 
A  +  Y   =   1- 

The  conditions  on  A  and  y  are  not  satisfied  in  Beckwith 's  article. 

For  example,  considering  the  IZEF  estimates  in  his  Table  2,  the  range  of 

values  for  A.  is  from  0.9814  to  1.0068,  for  y-  from  -0.0030  to  0.0133,  and 

for  A.  +  Y.  from  0.9891  to  1.0136.   Clearly,  the  various  A.  estimates  are 
11  i 
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(J>  =  Q  B  I   note  however  that  0.   is  singular  since  if  the  market  shares  sum 

to  one,    r   e.    =0   (see  [9],  p.  1203);  as  a  result  one  of  the  brands 
±=1     ^'"^ 

observations  ought  to  be  deleted  from  the  set,  so  that  the  method  should  be 

applied  to  n-1  brands. 


RESTRICTIONS  ON  THE  DISTURBANCE  TERMS 

Market  share  is  a  quantity  obviously  confined  to  the  interval  from 
zero  to  one.   This  natural  restriction  limits  the  set  of  distributions  capable 
of  describing  the  behavior  of  the  disturbances  (see  [9],  p.  1205)  since 

^  1  ^^S^   1  ^  implies : 
1 ,  t 

-1  <_  -(A-MS.  ^  .  +  Y-AS.  ^)  1  e.  .1  1-(X-MS      +  yAS   J  1  1 

which  indicates  that  the  normality  assumption  is  irrelevant. 

As  pointed  out  by  Theil  ([13],  pp.  629  et  seq.),  this  aspect  is 
awkward  in  regression-type  situations  and  therefore  a  monotonic  transformation 
is  usually  applied  to  this  particular  kind  of  dependent  variable  so  that 
the  new  dependent  variable  constructed  is  then  defined  over  the  [-°°    ,   +  "=] 
range.   In  fact,  many  transformations  have  this  property.   The  logit  is  one 
of  them  and  in  our  case  the  first  step  amounts  to  defining  a  new  variable 


"■i,t      l-MS^^^  ' 

a  quantity  with  range  [0,  <»]  since  when  MS    =  0,  m   ,.  =  ^  ^nd  when 

"^i,t  =  1'  '"i,t  =i=   ^- 

The  log  of  m    is  then  defined  over  the  [-  °o,  +  «>]  -  interval  since 
^  >  t 
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(log  0)  =  -  oo  and  (log  +  oo  )  =  +  oo.   Adjusting  the  original  model 
accordingly  we  obtain  the  equation  to  estimate.   In  Beckwith's  case  we  come 
up  with: 


log 


^•^^i.t-l^^-^^i,t 


log 


MS, 
i,t 


1-MS. 
1,  t 


log  (X-MS^  ^  +  Y-AS^  ^)  -  log  (1-AMS^  ^_^  -  y'AS.  ^) 


which  has  now  an  intrinsically  nonlinear  form. 

Another  example  of  the  same  transformation  is  presented  in  the  next 
section. 

INTRINSICALLY  NONLINEAR  FORMS:   A  NECESSITY  ? 


Now  one  might  argue  that  the  requirements  A   =  A  and  Y.  =  Y  for  all 
i,  are  not  very  appealing.   Indeed  they  do  not  allow  for  differentiation 
between  brands,  in  the  way  market  shares  respond  to  advertising  decisions. 
We  come  then  to  the  conclusion  that  if  we  want  a  logically  consistent  model 
which  allows  for  differences  in  the  response  parameters  between  the  various 
brands,  the  model  structure  should  simply  be  nonlinear. 

We  may  wonder  at  this  point  whether  such  a  structure  may  easily  be 
defined.   Usually  when  market  share  functions  are  nonlinear,  they  are  of  such 
a  form  as  to  become  linear  upon  transformation.   For  example,  multiplicative 
functions  are  widely  used  in  empirical  work.   The  multiplicative  equivalent 
of  the  expected  value  of  Beckwith's  market  share  function  is, 

(11)     MS^^^   =   MS^^^_J   AS^j^ 
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For  a  multiplicative  response  function  such  as  (11)  it  is  not  possible  to 
determine  meaningful  restrictions  on  the  explanatory  variables  and  on  the 
parameters,  which  will  guarantee  that  market  shares  add  up  to  one.  More 
complex  functions  are  generally  needed. 

For  example,  with  advertising  and  lagged  market  share  as  the  only 
explanatory  variables,  let 

(12)     MS.  ^   =   A  MS,  ^  ,  +  (1-X) 


i,t         i,t-l    ^    '    n   Y     ' 

k=l  ^'^ 

where  A    is  brand  i's  advertising  expenditures.   It  is  readily  seen  that 
1 ,  t 

the  sum  of  the  market  shares  as  defined  in  (12)  is  one,  and  this  without 
specific  constraints  on  the  y  parameters.   Unfortunately  such  formulations 
have  some  problems  of  their  own.   Determining  the  values  of  the  parameters 
is  less  straightforward  than  in  the  linear  case,  although  many  nonlinear 
programming  procedures  are  now  available  which  make  nonlinear  estimation 
much  less  of  a  problem.   However,  the  statistical  properties  of  nonlinear 
estimators   are  much  weaker.   These  are  probably  the  main  reasons  why  in 
empirical  work  one  has  usually  avoided  intrinsically  nonlinear  functional 
forms.   Only  a  few  applications  of  nonlinear  estimation  techniques  were  re- 
ported in  the  marketing  literature,  among  them:   Glen  Urban' s  product  line 
study  [lA]  and  Kuehn,  Mcguire  and  Weiss'  estimation  of  Kuehn's  model  [6]. 

Some  readers  may  now  ask  themselves  whether  it  is  possible  to  design 
any  logically  consistent  market  share  model  which  may  be  linearized.   The 
answer  is  Yes  but  .  .  .  Rather  "heroic"  assumptions  have  to  be  made  and 
non  trivial  transformations  have  to  be  devised. 
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Let  us  illustrate  this  point  further.   Suppose  the  consumption  pattern 
on  a  two-brand  market  may  be  described  by  a  Markov-type  matrix: 

T+1 

r  .         .    " 


where  A   is  the  proportion  of  consumers  loyal  to  brand  i, 

a   is  the  proportion  of  consumers  switching  from  brand  c  to  brand  i; 

X  and  Oj  are  defined  similarly  but  refer  to  brand  c. 
c      i 

We  can  define  the  impact  of  the  competitive  forces  on  the  consumption 
habits  by  making  \    ,   a    ,    A  and  o   explicit  functions  of  the  two  brands 
marketing-mix.   Considering  only  the  effect  of  advertising  (A),  we  could 
postulate: 


A.  ^  =  1-e 


-a. A,  . 
1  i,t 


.  r 
-a  A 
c  c,t 


c,t 


(13) 


-a. A.  ^ 
.   1  i,t 


i,t 


A   ^   =   1-e 
c,t 


-a  A 
c  c,t 


where : 


::   a/^   =  A,   /A  ^   and  A  ^.   =   A   /A 

i,t      i,t   c,t        c ,t      c,t   l,t 

from  which  we  deduce  the  nonlinear  market  share  equation: 


(14) 


and  at  equilibrium,  assuming  no  change  in  the  brands  advertising  budgets 
until  equilibrium  is  reached,  then: 


14 


-a  A 
c  c 


(15)    E  IMS.^JA^.  AJ   =  JITTV 


-a, A.      -a  A 
i  i   ,     c  c 
e       +  e 


If  we  are  willing  to  assume  that  consumption  patterns  are  sufficiently 
stable,  which  is  very  often  the  case  when  the  primary  demand  reaches  the 
maturity  stage  and  when  product  innovations  do  not  occur,  and  provided  that 
the  unit  time  period  is  sufficiently  long,  then  the  equilibrium  may  be  achieved 
within  period  t  and  we  may  retain  equation  (15)  .   Applying  the  logit 
transformation  to  it  we  obtain: 


log 


^Si,t  ^ 


1-MS. 


log 


i.t 


-a  A  ""^^ 
c  c,t 


-"i^i,t 


-ct  A 


log  (e 


c  c,t.    T    ,        i  i  ,t. 
'  )  -  log  (e     '  ) 


which  means  that  we  are  now  able  to  apply  the  OLS  technique  to 


r  MS      ] 


1-MS 


The  error  term  e,    is  thus  defined  over  the  range  [-«=,+  «=]  and  if 

Q 

we  consider  MS.    as  a  binomial  frequency   then  e    is  asymptotically 
X  ,  t  1  >  t 

normally  distributed  ([13],  p.  636). 

It  should  be  clear  that  the  linearization  of  equation  (15)  is  critically 
conditional  upon  the  exponential  specification  chosen  for  X^,  o^,  X^   and  o^. 
It  should  also  be  obvious  that  a  lot  of  very  restrictive  assumptions  have  to 
be  made  in  order  to  select  equation  (15)  instead  of  the  intrinsically  nonlinear 
form  (14). 
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CONCLUSION 

In  this  paper  we  derived  restrictions  on  explanatory  variables,  para- 
meters, and  disturbances  implied  by  an  additivity  constraint  on  the  depen- 
dent variable,  for  example,  market  share.   These  restrictions  are  such  that 
linear  market  share  structures  do  not  allow  for  differences  in  the  response 
parameters  for  various  brands.   We  argued  that  to  be  logically  consistent 
market  share  models  should  generally  be  intrinsically  nonlinear. 

This  does  not  mean,  however,  that  in  future  empirical  work,  intrin- 
sically linear  market  share  functions  should  no  longer  be  used.   We  should 
recognize  the  fact  that  linear  models  are  easier  and  less  expensive  to 
estimate,  the  statistical  results  are  believed  to  be  more  straightforward 
to  interpret,  and  when  used  as  aids  in  decision  making,  optimal  allocation 
rules  are  simpler  to  derive  and  to  apply.   Predicted  market  share  values  from 
an  intrinsically  linear  model  may  often  provide  sufficiently  close  approxi- 
mations.  Moreover,  when  estimated  parameters  are  used  as  prior  estimates, 
which  are  subsequently  adjusted  by  managerial  judgment,  such  an  approximation 
is  perhaps  even  more  acceptable.   For  an  interesting  discussion  on  these 
issues  we  refer  to  Lambin  [7,  pp.  120-21], 
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APPENDIX 
MISCELLANEOUS  REMARKS  ON  BECKWITH' S  APPROACH 

Suppose  the  linear  market  share  model  were  acceptable,  i.e.  the 
constraints  defined  above  are  satisfied,  other  specification  tests  are  still 
to  be  performed.   Although  Beckwith  does  not  completely  neglect  this  aspect, 
the  tests  he  carried  through  ([2],  p.  171)  are  not  very  relevant. 

Including  additional  lagged  dependent  variables  (MS     »  and  MS   j._-j) 

should  not  throw  much  light  on  the  model  specification.   On  the  contrary, 
examining  alternative  lag  distributions,  e.g.  the  Pascal  distribution 
([13],  pp.  264-267),  or  checking  whether  the  geometric  decline  starts  at  the 
first  lag  or  at  the  k-th  one  are  questions  to  be  dealt  with  seriously.   Thus, 
for  example,  when  the  time  period  is  short-one  month,  in  Beckwith 's  case  - 

it  may  be  unrealistic  to  assume  that  the  successive  advertising  coefficients 

9 
decline  immediately. 

Moreover  Beckwith  does  not  pay  any  attention  to  the  problem  of 

autocorrelation  which  may  prevail  among  the  disturbances  of  each  brand 

equation  and  may  indicate  misspecif ication  or  measurement  errors  in  the 

variables  (see  [8],  p.  50A).   In  a  working  paper  upon  which  Beckwith 's  article 

is  based,  however,  the  residuals  of  the  brand  market-share  model  are 

tabulated  and  the  Durbin-Watson  statistics  d  computed  (see  [1],  p.  23).   The 

latter  are  of  course  biased  due  to  the  presence  of  the  lagged  dependent 

variable  on  the  right-hand  side  of  the  equations.    But  provided  that  these 

statistics  were  adjusted  for  gaps  in  the  data  set   ,  we  can  use  them  to 

derive  the  new  DURBIN  statistic  h,  which  in  the  case  of  independence  of  the 

disturbances  is  asymptotically  distributed  as  a  standardized  normal  variate 

(see  [3],  p.  419). 
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1/2 
h  =  a   i 

1  -  n.v(X) 

in  which  a  is  the  autocorrelation  coefficient  of  residuals, 

n  is  the  number  of  observations , 

and  V(X)  is  the  estimate  of  the  variance  of  the  lagged  dependent 
variable  coefficient  given  by  least-squares  analysis. 

The  h  statistics  computed  on  Beckwith's  IZEF  brand  market-share 

12 
model   are  reported  in  Table  1.   Brand  A  and  B's  h  statistics  are  greater 

than  the  critical  10%  value  and  may  indicate  autocorrelation. 

TABLE  1 


DURBIN  STATISTICS 

Brand 

a 

h 

A 

-0.235 

-1.3904 

B 

0.260 

1.5380 

C 

0.135 

0.7989 

D 

-0.085 

-0.5028 

E 

-0.170 

-1.0059 

*Critical  values  at  the  10%  and  5% 

levels,  under  H  :  a=o,  are  1.282  and 
'        o 

1.645,  respectively. 

One  should  be  careful,  however,  in  using  the  h  statistics  for  testing 
autocorrelation  of  disturbances  in  a  Koyck  model.   In  fact,  the  h  statistics 
tests  the  non-autocorrelation  against  the  alternative  hypothesis  of  a 
first-order  Markov  dependence  scheme.   If  we  now  assume  that  disturbances 
(e    )  in  the  original  model: 


18 


MS 


i,t        J:-  i.t-T    1,1 


are  Independent, when  we  apply  Koyck's  transformation  the  resulting  distur- 
bances (e.   )  get  correlated  and  their  autocorrelation  scheme  does  not 
i,t   ^ 

follow  a  first-order  Markov  process,  since  e.  ^  =  e.  ^  -A-  e.  ^  , ,  in  which 

^  i,t    i,t      i,t-l 

case  the  h  statistics  is  irrelevant. 

Griliches  ([4],  pp.  41-42)  liscusses  an  instrumental  variable 
approach  which  does  not  depend  on  a  particular  structure  of  the  disturbances 
and  should  yield  a  consistent  estimator  of  X.   Applied  to  Beckwith's  model 
this  two-stage  approach  amounts  to  first  estimate  the  form: 


(17)   MS.     =  b  •  AS,  ^  +  b„-  AS.  ^  T  +  b„-  AS .  ^  _  +  ...  +  v.  ^, 
i,t      1    i,t    2    i,t-l    3    i,t-2  i,t 


including  additional  lagged  AS.      terms  as  long  as  they  contribute  to  the 

explanation  of  the  market  share,  and  then  re-estimate  the  usual  Koyck- 

transformed  equation  in  which  MS   ._   predicted  by  (17)  has  been  substituted 

1 ,  t  i 

for  MS.  ^  , ,  that  is 
i,t-l 

^^«>   "^i,t  =   ^-  ^i,t-l^^'  ^^i.t^  ^i,t^^-  ^,t-l 


The  estimated  A  and  >  are  then  consistent  provided  that  AS    is  a  true 
exogenous  variable  (at  least  with  respect  to  MS    ). 

The  above  discussion  indicates  that  Beckwith  has  paid  little  atten- 
tion to  the  problems  of  model  specification,  disturbance  autocorrelation  and 

13 

estimates  consistency. 
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On  the  other  hand,  he  focused  considerably  on  the  testing  for  nor- 
mality of  the  errors  distribution,  while  as  he  stated  it  himself  this 
assumption  has  only  a  minor  importance:   "...  Most  of  the  properties  of 
this  estimator  (IZEF)  are  not  dependent  upon  assumed  normality"  (see  [2], 
p.  169).   Furthermore  as  we  noted  it  earlier,  testing  the  normality 
assumptions  is  irrelevant  in  this  case. 

More  surprising  is  his  testing  of  residuals  zero  mean  (see  [2],  p.  171), 

since  it  is  not  true,  in  general,  that: 

either         the  OLS  residuals  have  zero  sum  when  the  equation  contains 
no  constant  term  (see  [13],  p.  40); 

or  the  GLS  and  joint  GLS  residuals  have  zero  sum  even  when  the 

equation  contains  a  constant  term,  due  to  the  use  of  the 
covariance  matrix  of  disturbances  in  the  estimation  method 
(see  [13],  p.  239) 

The  market-share  model  specification  and  estimation  is  not  the  only 
matter  of  concern  for  us  in  Beckwith's  paper.  The  normative  inferences  he 
deduced  are  also  subject  to  caution. 

The  author  omitted  to  mention  that  the  optimal  advertising  level  he 

proposed  (see  [2],  p.  173)  was  derived  under  the  following  restrictive 

assumptions: 

1)      Cournot  behavior  of  the  decision-makers,  i.e.  each  firm  determines 
its  optimal  level  regardless  of  the  possible  reactions  of 
its  competitors: 

dA°  ^ 

* 

dA, 
i,t 

where  A    is  the  amount  spent  on  advertising  by  brand  i's 
competitors. 
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2)      Stable  primary  demand,  i.e.  M  =  a  constant,  while  in  fact 
Beckwith  himself  wrote:   "Consumption  was  seasonal, 
generally  increasing  over  the  period  studied,  and  believed 
to  be  constant  per  capita."   (see[2],  p.  169)   -so  that  in 
deriving  the  optimal  level  for  advertising  expenditures 
the  fact  that  M  changes  with  the  increase  in  the  number  of 
potential  consumers  should  be  taken  into  account.   So  for 
example,  if  the  potential  market  grows  at  a  rate  r,  a  month, 
the  industry  demand  can  be  represented  by: 

M   =  M   (1  +  r)""*^ 
a      t 

and  the  derived  optimal  advertising  is: 

\  ^i  '"i  ^^  -^  ^i>     n   1  ^'^  o    I 

(1  +  d.)-(l  +  r)A.    i,t         i,t  1 


A,     =  MAX 
i.t 


where  d.  is  the  monthly  cost  of  capital  and  m.  the  incremental 
unit  profit  margin. 


Needless  to  say  that  the  constraint  (1  +  r)  A.  <  (1+d.)  is  to  be 
■^  11 

met  if  the  above  formula  is  to  make  sense.   Given  Beckwith's  estimates  and 
data  this  amounts  to: 

1  +  d. 

X 

using  the  largest  estimate  of  A  ,  i.e.:   0.9979  and  given  d   =  0.008,  r 
should  be  less  than  0.010121. 
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Footnotes 

By  '  itrinsically  nonlinear  forms  we  mean  equations  which  cannot 
be  linearized. 

2 
In  fact,  we  should  add  a  disturbance  term  e.   .   Since  we  will  be 

dealing  with  expectations  e    will  be  deleted.    ' 

3 

A  similar  theorem  was  derived  by  Schmalensee  ([12],  pp.  47-49).  In 
his  demonstration,  he  makes  use  of  the  properties  of  the  row  kernel  of  the 
matrix.   Our  proof  is  more  concise  and  generalizes  Schmalensee's  theorem. 

4 
Note  that  r  could  be  time-dependent,  that  is,  the  constraint  on  y 

is  u'y   =  r  .   For  example,  if  the  dependent  variable  were  brand  sales, 

q.   ,  their  sum  should  add  up  to  industry  sales,  Q  ,  which  may  be  time 

dependent.   In  the  formulation  of  the  theorem  the  c   are  then  replaced  by 

c.   ,   The  proof  is  not  affected  except  that  r  becomes  r  and  c.  becomes 

c . 

If  S  .,  were  zero  for  all  i  and  k,  none  of  the  X's  would  be  explana- 
tory variables.  If  at  least  one  of  the  X's  is  a  true  explanatory  variable, 
s  and  k  can  always  be  chosen  such  that  6  j^  ii^  o. 

(\  n  n 

For  example  in  (11),  by  definition  ^  MS.   _   =  1,  and  )]     AS    =  1. 

i=l     '  i=l     ' 

With  A.  =  o,  and  y      =   1   for  all  i,  the  sum  of  the  expected  market  shares  is 
one.   However,  these  restrictions  are,  at  least  a  priori,  not  very  laeaningful. 

Provided  of  course  that  no  change  in  the  advertising  strategy  of  both 
firms  occurs  during  period  t.   Similar  assumptions  were  implicitly  made  by 
Hartung  and  Fisher  [5].   We  are  also  presently  investigating  similar  forms  as 
well  as  nonlinear  structures  applied  to  the  appraisal  of  distribution  network 
performance  [11]  . 

g 
An  acceptable  assumption  since  we  consider  the  MS.    as  conditional 
equilibrium  market  shares.  ' 

9 
For  a  study  of  some  of  these  problems,  see  [10]. 
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The  residuals  are  less  correlated  than  the  disturbances  and  therefore 
"it  is  not  ■'ossible  to  make  a  correct  assessment  of  the  correlation  of  the 
errors  usi!  g  the  autocorrelation  coefficients  of  the  residuals"  (see  [8], 
p.  556). 

Since  Beckwith  eliminated  13  observations  corresponding  to  "unusual 
transients"  (see  [2],  p  170),  13  gaps  should  be  considered  in  the  computation 
of  the  Durbin-Watson  statistic   d 


T  -1 

t  S  t  K+1  -  ^J 


d  = 


13 

I 

g=l 


E 

T  =  t 


where  [t  ,  T  ]  delimits  the  observations  between  gaps;  the  autocorrelation 

g    g 
coefficient  is  then  deduced  via: 

^   1   d 
a  =  1-  2  . 

12 

Although  the  residuals  of  the  equations  estimated  separately  by  OLS 

are  not  available  we  can  use  the  IZEF  residuals  since  we  are  only  interested 

in  their  degree  of  autocorrelation  (while  the  joint  GLS  method  only  adjusts 

for  contemporaneous  covariances  across  equations). 


13 

"Correlation  of  the  errors  may  greatly  bias  estimation  of  the  para- 
meters of  an  autoregressive  model,  when  this  estimation  is  carried  out  by 
the  method  of  least-squares"   ([8],  p.  558). 
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